# Instruction Fine-tuning

Kakaocorp.kanana 1.5 8b Instruct 2505 GGUF
Kanana-1.5-8B-Instruct-2505 is an 8B-parameter instruction fine-tuned language model developed by Kakao Corp, suitable for text generation tasks.
Large Language Model
K
DevQuasar
483
1
Marin Community.marin 8b Instruct GGUF
marin-8b-instruct is an 8B-parameter-scale instruction fine-tuned language model suitable for text generation tasks.
Large Language Model
M
DevQuasar
343
1
Allenai.olmo 2 0425 1B Instruct GGUF
OLMo-2-0425-1B-Instruct is a 1-billion-parameter instruction-finetuned language model developed by AllenAI, focused on text generation tasks.
Large Language Model
A
DevQuasar
220
1
Olmo 2 0425 1B Instruct GGUF
Apache-2.0
OLMo 2 1B Instruct Edition is a post-training variant of the OLMo-2-0425-1B-RLVR1 model, optimized through supervised fine-tuning, DPO training, and RLVR training to achieve state-of-the-art performance across multiple tasks.
Large Language Model English
O
unsloth
3,137
3
Josiefied Qwen3 4B Abliterated V1 Gguf
Apache-2.0
This is the GGUF quantized version of the Josiefied-Qwen3-4B-abliterated-v1 model, suitable for local deployment and execution.
Large Language Model
J
Goekdeniz-Guelmez
4,518
7
Olmo 2 0425 1B Instruct
Apache-2.0
OLMo 2 1B is a post-training variant of the allenai/OLMo-2-0425-1B-RLVR1 model, undergoing supervised fine-tuning, DPO training, and RLVR training, aiming to achieve state-of-the-art performance across multiple tasks.
Large Language Model Transformers English
O
allenai
5,127
33
Stablelm Zephyr 3b GGUF
Other
StableLM Zephyr 3B is a 3-billion-parameter instruction-tuned model trained on public datasets, synthetic datasets, and Direct Preference Optimization (DPO), delivering excellent performance.
Large Language Model English
S
brittlewis12
51
1
Gemma 2 9b It Abliterated GGUF
A quantized version based on Gemma 2.9B, optimized using llama.cpp, suitable for running in LM Studio.
Large Language Model English
G
bartowski
3,941
37
Badger Writer Llama 3 8b
Badger Writer is a normalized Fourier task superposition model based on multiple Llama 3 8B models, specializing in text generation tasks, particularly excelling in creative writing and instruction following.
Large Language Model Transformers
B
maldv
106
10
Gemma 2 Llama Swallow 27b It V0.1
A Japanese-enhanced large language model based on the Gemma-2 architecture, significantly improving Japanese capabilities while retaining original English proficiency
Large Language Model Transformers Supports Multiple Languages
G
tokyotech-llm
27
1
Gemma 2 Llama Swallow 2b It V0.1
The Gemma-2-Llama-Swallow series is built through continued pre-training of the gemma-2 model, significantly enhancing Japanese language processing capabilities while retaining original English proficiency.
Large Language Model Transformers Supports Multiple Languages
G
tokyotech-llm
61
1
Olmo 2 0425 1B
Apache-2.0
OLMo 2 1B is the smallest model in the open language model series released by the Allen Institute for Artificial Intelligence, based on OLMo-mix-1124 pre-training and further trained with the Dolmino-mix-1124 dataset during the intermediate training phase.
Large Language Model Transformers English
O
allenai
13.31k
45
Videochat R1 Thinking 7B
Apache-2.0
VideoChat-R1-thinking_7B is a multimodal model based on Qwen2.5-VL-7B-Instruct, focusing on video-text-to-text tasks.
Video-to-Text Transformers English
V
OpenGVLab
800
0
Multilingual E5 Large Instruct Q8 0 GGUF
MIT
Multilingual E5 large instruction model, supporting text embedding and classification tasks in multiple languages with strong cross-language capabilities.
Large Language Model Supports Multiple Languages
M
Gomez12
90
1
R01 Gemma 3 1b It
Gemma 3 is a lightweight open-source multimodal model introduced by Google, built on the same technology as Gemini, supporting text and image inputs to generate text outputs.
Text-to-Image Transformers English
R
EpistemeAI
17
1
JEE 14B
This model is a text generation model fine-tuned based on Qwen2.5-14B-Instruct and trained using the TRL library.
Large Language Model Transformers
J
ruh-ai
475
4
Toastypigeon Gemma 3 Starshine 12B GGUF
A creative writing model based on Gemma 3 12B, excelling in narration and scene construction with a novelistic style
Large Language Model English
T
ArtusDev
223
2
Allura Org Gemma 3 Glitter 4B GGUF
GGUF format model file converted from allura-org/Gemma-3-Glitter-4B, optimized with imatrix quantization
Large Language Model English
A
ArtusDev
69
1
Doge 320M Instruct
Apache-2.0
Doge 320M Instruct is a lightweight language model based on dynamic masked attention, trained with supervised fine-tuning (SFT) and direct preference optimization (DPO), suitable for question-answering and dialogue tasks.
Large Language Model Transformers English
D
SmallDoge
12.61k
3
Thedrummer Fallen Gemma3 4B V1 GGUF
Other
This is a quantized version of TheDrummer/Fallen-Gemma3-4B-v1 model, processed using llama.cpp, suitable for text generation tasks.
Large Language Model
T
bartowski
2,106
3
Mistral Small 3.1 24b Instruct 2503 Hf
Apache-2.0
Mistral Small 3.1 Instruct 24B is a large language model based on instruction fine-tuning, focusing on text generation tasks.
Large Language Model Transformers
M
mrfakename
9,416
9
Qwen2.5 Bakeneko 32b Instruct V2
Apache-2.0
An instruction-tuned variant based on Qwen2.5 Bakeneko 32B, enhanced with Chat Vector and ORPO optimization for improved instruction-following capabilities, excelling in Japanese MT-Bench.
Large Language Model Transformers Japanese
Q
rinna
140
6
Teacher Persona GGUF
Qwen2-1.5B-Instruct is a 1.5 billion parameter instruction fine-tuned large language model released by Alibaba Cloud, suitable for Q&A and dialogue tasks.
Large Language Model
T
RyZhangHason
24
1
T3Q Qwen2.5 14b V1.2 E2
Apache-2.0
T3Q-qwen2.5-14b-v1.2-e2 is a post-trained version based on the Qwen/Qwen2.5-14B-Instruct-1M model, using LoRA-8-4-0.0001-cosine-32-16 configuration and trained on train_data_v1.2.
Large Language Model Transformers Supports Multiple Languages
T
JungZoona
119
8
T3Q Qwen2.5 14b V1.0 E3 Q4 K M GGUF
Apache-2.0
This is a quantized model based on Qwen2.5-14B-Instruct-1M, converted to GGUF format, suitable for the llama.cpp framework.
Large Language Model Supports Multiple Languages
T
Sangto
1,126
4
Gemma 3 12b Novision
A text-only version converted from google/gemma-3-12b-it, with visual components removed, focusing on text generation tasks
Large Language Model Transformers
G
gghfez
86
2
Google.gemma 3 4b It GGUF
Gemma 3.4B IT is a 3.4 billion parameter large language model developed by Google, focusing on the instruction-tuned version, suitable for various natural language processing tasks.
Large Language Model
G
DevQuasar
141
1
Traceback 12b
Apache-2.0
TraceBack 12b is a 4bit quantized version based on the Mistral-Nemo-Instruct architecture, focusing on instruction-following and chain-of-thought reasoning tasks.
Large Language Model Transformers
T
secemp9
1,470
29
Llama 3.1 8b Medusa V1.01
An 8B-parameter language model based on the Llama 3.1 architecture, created by merging multiple specialized models, excelling in text generation tasks.
Large Language Model Transformers
L
Nexesenex
95
3
Kanana Nano 2.1b Instruct
Kanana is a bilingual (Korean/English) language model series developed by Kakao. This 2.1B parameter version outperforms similar models in Korean while maintaining efficient computational costs.
Large Language Model Transformers Supports Multiple Languages
K
kakaocorp
5,994
59
Hiber Multi 10B Instruct
Hiber-Multi-10B-Instruct is an advanced multilingual large language model based on Transformer architecture, supporting multiple languages with 10 billion parameters, suitable for text generation tasks.
Large Language Model Transformers Supports Multiple Languages
H
Hibernates
86
2
Huihui Ai.qwen2.5 14B Instruct 1M Abliterated GGUF
A 14B-parameter large language model focused on instruction-following tasks, supporting text generation capabilities.
Large Language Model
H
DevQuasar
550
1
Nousresearch DeepHermes 3 Llama 3 8B Preview GGUF
A dialogue model fine-tuned based on Llama-3-8B, supporting multiple quantization versions, suitable for tasks such as chatting, reasoning, and role-playing.
Large Language Model English
N
bartowski
1,038
16
Guardreasoner 1B
Other
GuardReasoner 1B is a version fine-tuned via R-SFT and HS-DPO based on meta-llama/Llama-3.2-1B, focusing on classification tasks for analyzing human-AI interactions.
Large Language Model Transformers English
G
yueliu1999
154
4
Guardreasoner 8B
Apache-2.0
GuardReasoner 8B is a fine-tuned model based on meta-llama/Llama-3.1-8B, specializing in reasoning-based LLM safety protection
Large Language Model Transformers
G
yueliu1999
480
2
Mistral Small 24B Instruct 2501 GGUF
GGUF quantized version of Mistral-Small-24B-Instruct-2501, suitable for local deployment and text generation tasks.
Large Language Model
M
MaziyarPanahi
474.73k
2
Deepseer R1 Vision Distill Qwen 1.5B Google Vit Base Patch16 224
Apache-2.0
DeepSeer is a vision-language model developed based on the DeepSeek-R1 model, supporting chain-of-thought reasoning and trained through dialogue templates for visual models.
Image-to-Text Transformers
D
mehmetkeremturkcan
25
2
Lake 1 Advanced
MIT
Mistral-7B-Instruct-v0.3 is a large language model fine-tuned for instruction following based on Mistral-7B-v0.3, supporting function calls and extended vocabulary.
Large Language Model
L
BICORP
62
2
Lava Phi
MIT
A vision-language model based on Microsoft's Phi-1.5 architecture, combined with CLIP for image processing capabilities
Image-to-Text Transformers Supports Multiple Languages
L
sagar007
17
0
Videochat TPO
MIT
A multimodal large language model developed based on the paper 'Task Preference Optimization: Improving Multimodal Large Language Models through Visual Task Alignment'
Text-to-Video Transformers
V
OpenGVLab
18
5
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase